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ABSTRACT 

We use the latest observations from the 2dF Galaxy Redshift Survey to fit the conditional luminosity 
function (CLE) formulation of the halo-model for galaxies at z = 0. This fit is then used to test the 
extent of evolution in the halo occupation distribution (HOD) to z = 0.8, by comparing the predicted 
clustering from this CLE to preliminary results from the DEEP2 Redshift Survey. We show that the 
current observations from the DEEP2 Redshift Survey are remarkably consistent with no evolution in 
the CLE from z = to z = 0.8. This result is surprising, in that it suggests that there has been very little 
change in the way galaxies occupy their host dark matter halos over half the age of the Universe. We 
discuss in detail the observational constraints we have adopted and also the various different selection 
effects in each survey and how these impact on the galaxy populations encountered in each survey. 
Subject headings: Galaxies: high-redshift — Cosmology: theory 



I. INTRODUCTION 

Despite impressive advances in many areas over the last 
" decade, galaxy formation remains one of the central un- 

■ solved puzzles in cosmology. While it is now generally ac- 
' cepted that galaxies form within dark matter halos (White 
,: & Rees I1978|l , whose properties we know about with in- 

■ creasing reliability, the details of this process are only 
I poorly understood. With the advent of large-scale surveys 
. of galaxies and ever more sophisticated computer simula- 
tions of structure formation we need to develop a more 
nuanced view of galaxy formation than has been typical 
until now. Key to advances in this subject is a framework 
within which to compare theory and observations in as 
transparent a way as possible. 

While studies of individual galaxies can shed light on 
\ many of the relevant physical processes which shape them, 
, galaxy clustering stands as the most successful route to 
' constraints on galaxy formation models to date. Thus we 
\ desire a framework within which we can interpret the nu- 
merous measurements of galaxy clustering that have accu- 
mulated over the years. 

The 'halo model' provides such a framework. Build- 
ing on the insights of semi-analytic galaxy formation 
(e.g. Kauflfmann, Whi te fc G uiderdoniTM?': Colcct al.TW: 
Somerville & Primack ll999|l and high-resolution hydrody- 
namic simulations including star-formation and feedback 
(Katz, Hernq uist fc Weinberg rn)99> Gardner et al. I^DUTl 
Pearce et al. White, Hernquist & Springel I^HUTl 

Yoshikawa et al. 2001 ) the halo model has been extensively 
developed in recent years, e.g. Jing, Mo fc Borner. ri998l 
Benso n et al . l^nnni Seliak OnOni Peaco ck & S mithOnUni Ma 
& EryOOnni Sc occim arro et al. 120011 White I^UHH Scocci- 
marro & Sheth lMJTI Berlind & Weinberg 2002, Scranton 
l^nU^l Yang Mo fc van den Bosch IM)3l (see Cooray & 
Sheth 120021 for a recent review and references to the ear- 
lier literature). 

The halo model postulates that all galaxies lie in virial- 
ized halos and that the number and type of galaxies in ha- 
los is determined primarily (or entirely) by the halo mass. 



Knowledge of the number of galaxies as a function of halo 
mass, known as the halo occupation distribution (HOD), 
their spatial and velocity distribution within the halos and 
a model for the spatial clustering of dark matter halos is 
sufficient to predict most observables in large-scale struc- 
ture. The halo model provides a new way of thinking about 
galaxy bias which is more physically informative than the 
earlier schemes. It also provides a conceptual division be- 
tween 'galaxy formation' and 'cosmology' in that the latter 
affects the spatial distribution and number of dark mat- 
ter halos, while the former describes the properties and 
number of galaxies which form within the halos. The key 
advantage of having this conceptual separation between 
halo and galaxy properties is that the evolution in the spa- 
tial distribution and number of dark matter halos can be 
well understood through the use of numerical simulations, 
allowing us to exclusively constrain galaxy formation pro- 
cesses with the use of recent large galaxy redshift surveys. 

The recent completion of the 2dF Galaxy Redshift Sur- 
vey (2dFGRS, Colless et aL lM7T|l . together with the on- 
going progress of the Sloan Digital Sky Survey (SDSS, 
Strauss et al. 2002), have heralded a new age of precision 
quantification in the galaxy population in the local (z ^ 0) 
Universe. The sheer size of these data sets enables a par- 
ticularly accurate characterization of the properties of the 
galaxies themselves, together with how these galaxies are 
distributed with respect to each other. These two ingre- 
dients will be key to resolving many outstanding issues in 
galaxy formation and evolution. The DEEP2 Galaxy Red- 
shift Survey (Davis et al. l2002|l will herald a similar degree 
of improvement in our understanding of the galaxies at 
z 1. The goal of this paper is to link these two significant 
advances in our understanding of the galaxy population, 
by performing an initial comparison between high fidelity 
2dEGRS results and preliminary DEEP2 clustering results 
(Coil et al. I2003p within the halo-model framework. 

We use the latest observations from the 2dFGRS to con- 
strain the HOD at z 0, by adopting the conditional lu- 
minosity function (CLE) formalism developed by Yang et 
al. (.2003.) . We make use of a Markov-Chain Monte-Carlo 
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(MCMC; see Gilks, Richarson & Spiegelhalter [TMBjl pro- 
cedure to explore the multi-dimensional parameter space 
of the CLF. At present we consider the cosmology to 
be fixed. Specifically we assume a ACDM model with 
r^ni = 0.3, f^A = 0.7, Hq = lOO/ikms-^Mpc"^ with 
h = 0.7, n^K^ = 0.02, n = 0.95 and erg = 0.9 (close to 
the best fit to the CMB data for this cosmology) . We note 
in passing that the matter power spectrum is insensitive 
to the precise value of Vl^h? assumed and that the normal- 
ization is well fixed by the recent WMAP data (Bennett 
et al. 12003(1 once a specific cosmology is chosen. 

Once we have constrained the CLF at z ^ in this way, 
we proceed to test the extent of evolution in the galaxy for- 
mation processes which are quantified by this function. In 
particular (by assuming a given cosmological model in our 
N-body simulations) we use this CLF to make predictions 
about observable galaxy properties at z ~ 1 - which are 
tested using the recent clustering observed in the DEEP2 
Galaxy Redshift Survey (Coil et al. ^M^ . 

The outline of this paper is as follows: In ^we describe 
the conditional luminosity function formulation of the halo 
model and summarize the main parameters in this model. 
Section O gives a detailed description of all the observa- 
tional constraints we will use in our analysis, and in par- 
ticular describes how we deal with the diS'erent selection 
efi^ects in the two surveys. Our fitting procedure is outlined 
in 21 which also discusses the best-fitting parameters out- 
put by the MCMC. In SjSlwe present our prediction for the 
clustering we would expect to see in the DEEP2 Redshift 
Survey if the CLF hadn't undergone any evolution, and 
then compare this to what is actually observed and we 
present our conclusions in 

2. THE CONDITIONAL LUMINOSITY FUNCTION 

Here we briefly review the conditional luminosity func- 
tion (CLF) formalism of Yang et al. (j^03). Further details 
on the halo model and references to the literature can be 
found in Appendix IXI 

Typically when implementing the halo-model, the HOD 
is assumed to simply yield the probability of having N 
galaxies in a halo of mass M, i.e. P(iV|M), and is hence 
independent of the individual properties of the galaxies 
under consideration (e.g. Seljak|2000 Peacock & Smith 
I2000|l . A large advance in the development of the HOD 
function was the recent work of Yang et al. ( 2003J, who 
focussed attention on the conditional luminosity function, 
$(L|M): the luminosity function of galaxies in halos of 
mass M . The CLF extends the HOD by treating galaxies 
not as indistinguishable objects but as carrying a luminos- 
ity 'label'. This requires an increase in complexity in the 
model, but allows us to include one of the most important 
galaxy properties naturally in our models. An alternative 
formulation would slice the 2D parameter space along the 
other axis and specify N{M\L) separately for bins of dif- 
ferent L. While these are mathematically equivalent, it 
seems more natural within the halo framework to specify 
the distribution of L at fixed M rather than the reverse. 
Guzik & Seljak (2003(1 suggested an alternative way of in- 
cluding luminosity information in the halo model, which 
we shall not pursue. 

With the CLF, combined with the halo mass func- 
tion n(Af), one can reconstruct the luminosity function 



of galaxies: 
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where dn/dM is the mass function and /(ly) is the multi- 
plicity function (see Appendix IXjl . Also the (large-scale, 
linear) bias of galaxies as a function of luminosity can be 
computed within this framework: 



biL) = ly ^L\M)biM)^dM , 
$(L|M) 
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We follow Yang et al. H2003|l and model the conditional 
luminosity function as a Schechter function. 



^{L\M)dL ■ 
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with the three functions: 5, L.^, and Here the tilde 
distinguishes these implicit functions of the halo mass, M, 
from the three parameters of the global luminosity func- 
tion. The full parameterization of CLF is derived based 
on arguments about the total mass-to-light ratio of a halo 
and the characteristic luminosity of a halo. 
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With the pre-factor. 
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introduced to make N^{M) — 1 for small mass halos 
(M ^ min[Mi, M2]). The mass-to-light ratio and the typ- 
ical luminosity are both assumed to have a broken power 
law form, arguments for which are given in Yang, Mo & 
van den Bosch ipnHji . 

For a{M), a simple linear function of log(Af) is adopted. 



i{M) = ai5 + 7;logio(A/i5) 



(8) 



with M15 the halo mass in units of lO^^h ^Mq and ais 



XJ.tArJ.V_/ J.J.J.tArCia J.J.J. HJ-J 

5(Mi5 = 1). 

There is a minor problem caused by this definition of 
a{M) in that the function f{a) becomes divergent when 
a < —1.3. For negative r/ this happens at high mass, where 

Lf,{M) ^ M^^''^ / f{a), leading to an unphysical decrease 
in with increasing mass. The drop in causes a drop 
in N{M, L > L^). Fortunately this only happens for very 
high mass halos which are rare enough that they do not 
affect the statistical properties which concern us here. 

Because the Schechter form is assumed, it can be shown 
that 



(L) (M) = / L^{L\M)dL = $,i,r[S + 2] 



(9) 



3 



allowing to be derived from the above expressions. 

There is an additional parameter Ms which is the mass 
scale where the local L* equals the global L*, i.e. 

L4Ms) = L, (10) 

Satisfying this requirement actually sets {AI/L)q. 

Thus in total there are 8 free parameters in the CLF 
algorithm: ais, rj, Mi, M2, 71, 72, 73, Ms- For a given set 
of parameters one can reconstruct the luminosity function 
and luminosity-dependent bias (semi-) analytically. Also 
the HOD can be computed from 

N{M, L > Lcut) = / HL\M)dL = $,r[a + 1, J^^] . 

(11) 

With the HOD and CLF, we are able to populate an N- 
body simulation with galaxies and compare the luminosity 
function, bias and correlation function with observations 
(see Appendix El- 

3. OBSERVATIONAL CONSTRAINTS 

In this paper we wish to search for evolution in the way 
in which galaxies populate halos. We do this by constrain- 
ing the conditional luminosity function at z = and seeing 
if recent results from DEEP2 allow us to rule out the 'null 
hypothesis' that this CLF is independent of redshift. 

Because the observations from these two surveys, and in 
particular how they relate to the different selection effects 
inherent in each survey, are so key to this undertaking we 
review here in some detail the observational constraints we 
will adopt in our subsequent analysis. 

3.1. The Galaxy Surveys 

The 2dFGRS (CoUess et al. OIMIl has recently been 
completed, having observed redshifts for 230,000 galaxies 
over ~ 2000 sq. degrees on the sky. The galaxies targeted 
have been selected to an extinction corrected 6j < 19.5 
magnitude limit from an updated version of the APM cat- 
alogue (Maddox et al. ll990l Maddox, Efstathiou & Suther- 
land [T79 0). As will be demonstrated, the sheer size of this 
data set allows us to rigorously constrain our models at 
z for later propagation to higher redshifts. 

The DEEP2 Galaxy Redshift Survey has recently begun 
to acquire large numbers of galaxy redshifts, and prelimi- 
nary analyses of the galaxy population and its clustering 
behavior have recently been published (Coil et al. 120031 
Madgwick et al. l2003c)l . This survey comprises objects se- 
lected from CFHT photometry that have been pre-selected 
to have redshifts z > 0.7 (Davis et al. l2002|l . and is selected 
to be a magnitude- limited sample based upon the observed 
Tab magnitudes to r < 24.1. 

It is clear that, because of the different bands adopted 
for the magnitude-based selection of the objects in these 
two surveys (6j in the 2dFGRS and tab in DEEP2), they 
may not in fact probe identical galaxy populations. For 
this reason we begin by comparing the relative mix of dif- 
ferent spectral types in these two surveys before proceed- 
ing to describe in more detail the various observational 
constraints from each. 

3.1.1. Selection effects and spectral types 



To investigate the relative importance of selection effects 
in the two surveys, we adopt here the spectral classifica- 
tions developed for the 2dFGRS and D EEP2 in Madg- 
wick et al. 1(2)02) and Madgwick et al. H2003ci!l respec- 
tively. These classifications, 772dF and t^deep, have been 
derived in an analogous way from a principal components 
analysis (PCA) of the galaxy spectra in each survey and 
can hence be used to compare the relative mix of differ- 
ent types of galaxies in each survey. The classification 
itself provides a continuous parameterization of the spec- 
tral type of a galaxy based upon the strength of nebular 
emission present in its rest-frame optical spectrum. It is 
found that 77 correlates relatively well with galaxy i3-band 
morphology at z (Madgwick |2003j). However, the 
most natural interpretation of rj is in terms of the relative 
amount of star formation occurring in each galaxy (Madg- 
wick et al. 2003a). 

In Fig.^wc compare the distributions of spectral types 
in the two surveys. It is clear from this comparison that 
the DEEP2 survey comprises relatively more galaxies that 
are undergoing recent star-formation (given that it con- 
tains proportionately more galaxies with high 77 values). 
This is in fact not surprising given the different selec- 
tion criteria in each survey. In the case of the 2dFGRS, 
the galaxies have been selected in the observed 6j band 
- which gives preference to galaxies with recent star for- 
mation. However, the observed frame r selection adopted 
for the DEEP2 galaxies (which are typically z ~ 1) in fact 
corresponds to a rest-frame J7-band selection - which is 
even more biased towards selecting galaxies with recent 
episodes of star formation. 

It is clear from these arguments and Fig. ^ that mak- 
ing a direct comparison between the 2dFGRS and DEEP2 
galaxy populations would not give a fair measure of evo- 
lution. However those galaxies in the 2dFGRS with more 



i.a 




Matched spectral classification, r] 



Fig. 1. — The distribution of different types of galaxies in the 
2dFGRS and DEEP2 are compared using a 'matched' spectral clas- 
sification (see Madgwick et al. .2003c for details). The dotted his- 
togram shows the distribution of galaxy types from the 2dFGRS 
and the solid histogram shows that from the DEEP2 Survey. From 
this comparison we can conclude that the DEEP2 Survey contains 
a proportionately larger sample of so-called 'late-type' galaxies. For 
this reason in what follows we compare the results from the DEEP2 
Survey to those of the late-type 2dFGRS galaxies — as well as for 
the full 2dFGRS sample. 
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recent star formation may be a more analogous galaxy 
population. 

In the analysis that follows we perform our fits for both 
the full 2dFGRS galaxy population (referred to as 'all' 
galaxies) and for just those galaxies with recent star for- 
mation (the 'late- type' galaxies). In so doing we can ef- 
fectively bracket the galaxy population that is actually 
observed in the DEEP2 Redshift Survey^. 

3.1.2. Galaxy clustering 

A key element of the analysis presented here is a detailed 
characterization of the galaxy clustering at both z = and 
z ^ 1. At z = we have several observational constraints 
on the two-point correlation function, ^(r), available to us 
(e.g. Norbe rg et al. I^UUTI Norberg et al. I^UU^ Hawkins 
et al. OnnSl Madgwick et al. I2003b|l . Because the actual 
type of galaxies under consideration appears to be so im- 
portant to making a fair comparison between the 2dFGRS 
and DEEP2, we choose to adopt the ^(r) estimates from 
Madgwick et al. H2003b|l . since these have been performed 
for both 'early' and 'late-type' galaxies. 

In terms of quantifying the degree of clustering at z ^ 1 
we have a much more limited selection. Coil et al. (2Q03) 
have recently presented initial estimates for the correla- 
tion function in the DEEP2 Redshift Survey for a variety 
of different samples. Because this survey occupies such a 
large range of redshift (0.7 < z < 1.4), we choose to adopt 
the constraints that have been placed on the correlation 
function over the more limited range 0.7 < z < 0.9, so 
that we can neglect evolution within this sample. Specifi- 
cally we shall assume that ^(r) is measured at the effective 
redshift of this sample, as calculated by Coil et al. p005jl . 
Zcff = 0.8. We also emphasize that the analysis presented 
by Coil et al. H2003fl made use of only 2219 galaxies drawn 
from a contiguous sub-region of the initial observations, 
and hence only represents ~ 5% of the total potential of 
the survey. We expect that constraints on the clustering 
of galaxies at z ~ 1 will improve dramatically in the near 
future. 

Note that preliminary estimates of the correlation func- 
tion have also been made for the types of galaxies in this 
survey analogous to the 2dFGRS, which would allow us to 
make much more detailed comparisons in which selection 
effects would be less important. However, given the small 
size of the samples available at this early stage no mean- 
ingful constraints can be obtained from these calculations. 

3.1.3. The luminosity function 

The LF provides a fundamental constraint in that it 
allows us to restrict the number density of galaxies at dif- 
ferent epochs. We use here the galaxy LF estimates made 
by Madgwick et al. (20021 , since these have been deter- 
mined for different sub-samples of galaxy types, which are 
applicable to the arguments of Sec. 13.1.11 This LF has 
been fit as a series of step functions (the SWML fit, see 
Efstathiou, Ellis & Peterson 1988). 

An important consideration in fitting to the LF is the 
fact that the normalization, of the LF has been de- 
termined independently of its shape. The normalization 

^Note that as more data becomes available in the DEEP2 Survey, 
a much more detailed analysis will become possible, in that we will 
be able to directly compare observations based upon different types 
of galaxies in this survey and the 2dFGRS. 



of the LF therefore provides a relatively independent con- 
straint on (f){L\M) and will be treated as such in our anal- 
ysis. 

3.1.4. Absolute bias 

The absolute bias, b{L^), of galaxies in the 2dFGRS has 
been determined by several authors over a range of differ- 
ent scales (Verde et al. 120021 Lahav et al. 120021 Hawkins 
et al. I2003f) . For our purposes we are most interested in 
the large-scale (linear) bias regime - which is most read- 
ily accessible to the analytic model used in the HOD. For 
this reason we do not adopt the bispectrum analysis pre- 
sented in Verde et al. H2002|l . which probes significantly 
non-linear scales. The determination of b made by Lahav 
et al. 2002 and Hawkins et al. (|2003) are in the fully linear 
and quasi-linear regimes respectively, with the Lahav et 
al. estimate corresponding to the largest scales. However, 
we choose to adopt the redshift-distortion based determi- 
nation of b from Hawkins et al. H2003|l . since this has also 
been determined for different types of galaxies (Madgwick 
et al. '2003 b') , allowing us to easily extend our analysis to 
incorporate only the late-type galaxies we expect to dom- 
inate in the DEEP2 Redshift Survey. 

Given the numerous uncertainties involved in determin- 
ing (b) we choose a conservative value 6(i») = 1 ± 0.2. 
As it will turn out the fit already prefers certain values of 
b{Lf), our loose constraint will not be too critical. 

3.1.5. Relative bias 

The observed bias is known to vary depending on sev- 
eral aspects of the galaxy population under consideration. 
Arguably one of the most fundamental variations is the 
change in relative bias with the intrinsic luminosity of 
the galaxy population under consideration. Norberg et 
al. H2001|) measured the bias as a function of luminosity, 
using the clustering of L, galaxies as a reference point. 
They found the points can be fitted by a linear relation: 

6(L,z = 0)/fe(i„z = 0) = 0.85 + 0. 15(i/L,) . (12) 

Norberg et al. H2002|l found it is the luminosity, not the 
type, that is the dominant factor causing the variation in 
the clustering strength. Also, Madgwick et al. (2 003b|l 
show that, at large scale, early and late type galaxies have 
almost the same clustering strength. We will include the 
luminosity dependence of b, but neglect the type depen- 
dence in the following. 

4. PARAMETER FITTING 

We have in total 9 free parameters that need to be con- 
strained using the results we have listed from the 2dFGRS. 
For this reason a simple grid-based search of the parameter 
space is not computationally feasible. Rather we choose 
to adopt a Markov Chain Monte Carlo (MCMC) method 
to explore the multi-dimensional space. This has signif- 
icant computational advantages over a grid-based search 
and in particular it allows us to more efficiently probe the 
maximum in the likelihood space. 

For every random walk step used by the MCMC, we 
evaluate its using; 

1. the observed luminosity function - 22 normalized 
SWML points between 0.05L* and 5L*; 
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2. the relative bias points in 6 luminosity bins and a 
rough estimate of the absolute bias 

3. the four priors we have added f ii4.1|l . 

We will additionally incorporate constraints from the ob- 
served correlation functions after the MCMC has been 
completed using an 'importance re-sampling' of the chain, 
as described later f H4.2l) . 

Note that because the normalization and the shape 
of the luminosity function provide relatively independent 
constraints, we want to treat them separately. So, we fix 
the normalization of the analytically computed points by 
the SWML luminosity bin nearest to L,, i.e. 



(13) 



We then sum the of the remaining A^; — 1 computed 
0(Li) points using SWML points and error bars. This 
way we get the of the shape of the luminosity function. 
For the normalization, it's not determined as accurate as 
the shape. We set $(L*) to have a fractional error of 10% 

around the observed value <1>(L*). 

For the bias, since the observational results are given in 
relative bias already, we do not need to do the same thing 
as we did for luminosity function. However, we do need to 
incorporate the of the absolute bias. 

The is evaluated through the following equation. 



2 2,2,2 

priors ' 



(14) 



where the first two terms are; 
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Here $(ii) and h are observed quantities and all symbols 
without a 'hat' are analytically computed quantities. We 
give the shape of the luminosity function a relative weight 
3 since it is measured with much less uncertainty than 
other constraints we used. As it turns out the constraint 
b{Lf,) = 1 ± 0.2 is not really needed as the fit will provide 
a tighter constraint. 

The chain is started from the model used in Yang, Mo 
& van den Bosch H2003|l and is then allowed a 'burn-in' 
period for the chain to equilibrate in the likelihood space. 
The cosmology used in the code is the same as in the 
simulation. At any point in the chain we generate a new 
trial element by drawing parameter shifts from indepen- 
dent Gaussian distributions in each of the nine CLF pa- 
rameters. The probability of accepting a new random walk 
step is taken to be 



P. 



accept 



exp [-(x^ 



1.0 

2 

new 



xL)/2] 



for x^ 
for X 



new ^ Xold 

2 > v2 
new Aold 



(17) 



Given the sampling strategy we adopted, the acceptance 
rate is around 25%, leading to nearly independent sam- 
ples in the chain after a (few) thousand elements. We 
ran four chains, with different random number seeds, each 
for 100,000 steps. The x^ does not decrease very much 
from the initial guess, indicating that the starting model 
is within the equilibrium region of the chain. 

4.1. Priors 

The CLF formalism has a 'free' function for halos of ev- 
ery mass. Even given the restriction to Schcchter forms 
this requires us to choose three free functions of mass (S, 
and $) to specify the model, and further parameter- 
izing these functions as (double) power laws still allows a 
great deal of freedom. 

Unfortunately, we have found that it is possible to ob- 
tain reasonable <&(L) and h{L) for 'pathological' N{M, L > 
Lent)- For example, the number of galaxies brighter than 
the local L*(Af) in a halo of mass M is proportional to 
j\/73-72^ If 72 becomes larger than 73 and L*(M) does 
not increase fast enough to compensate, the number of 
galaxies brighter than (the global) in a halo of mass M 
will decrease for M ^ max [Mi, M2]- 

To eliminate these and other instabilities we have chosen 
to apply some priors on N{M). In most cases the patho- 
logical models would be rejected by comparing the pre- 
dicted and observed correlation functions (see M.2|l . how- 
ever we include those constraints after the chain has been 
run. In this sense one can consider our priors as a way to 
increase the efficiency of our procedure. 

After extensive experimentation we found that the fol- 
lowing additional priors worked quite well in our analysis. 
First we apply a Gaussian prior that ai5 = —1.0 ± 0.2 
to ensure that only negative values of ais are adopted by 
the chain. Without this prior positive values of ai5 occur 
quite regularly, despite being unphysical. In particular, we 
have based this prior upon composite observations of the 
Coma cluster LF by Driver & De Propris H2002|l , who have 
derived a faint-end slope of a = — LOllg gj, for this clus- 
ter. We allow more flexibility in our constraint than this, 
to allow for the fact that this is only a rough constraint 
on our model. 

We also provide two priors on the shape of N{M), both 
log-normal. The first sets N{10^^ /i-^M©) to a fiducial 
value with a factor of 5 (Iti) error and the second provides 
a similar constraint on N{10^^ H^^Mq). After the impor- 
tance resampling described in t|4.2l the distributions of A^i5 
and N12 are both much narrower than the imposed priors, 
indicating that the exact form is not important. 

Our last prior provides a constraint on the number of 
galaxies residing in halos of very low mass. This prior 
has been included for both theoretical and practical rea- 
sons and again the distribution in the chain after impor- 
tance resampling f ti4.2|) is much narrower than the prior. 
Theoretically we believe that even low luminosity galax- 
ies do not live in very low mass halos, M <C 10^° h~^MQ, 
and we wish to encode this information in the fit. The 
practical reason is that our N-body simulation has limited 
mass resolution which makes it impossible for us to com- 
pute ^(r) for such models. Mathematically the problem- 
atic models arise when L.^, does not decline fast enough 
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with mass. In this situation many of the fainter galax- 
ies can reside in halos of mass smaller than 10^*^ H^^Mq. 
Even though N{M, > O.IL,) is in the range 10"'' to 10"^ 
for these halos, their huge number ensures a significant 
contribution to the total number of galaxies. We require 
L*(M = IO^^H-^Mq) < 0.02L* so that almost no O.li* 
galaxies live in halos with M < 10^" h~^MQ. 

4.2. Importance re-sampling 

The MCMC calculated so far has adopted only the con- 
straints from the 2dFGRS LF and biasing results to con- 
strain the CLE at 2; ~ 0. In fact there is substantially 
more information available to us in the form of very accu- 
rate determinations of the two-point correlation function, 

It is possible to directly relate the CLE to the correla- 
tion function analytically, by making various assumptions 
about the way halos and galaxies arc distributed relative to 
each other (e.g. Peac ock fc Smith . 2000,: Seljak 2000: Mag- 
gliochetti & Porciani l2UU3l Zehavi et al. [2003). However, 
given that we have detailed N-body simulations available 
to us, we instead choose to populate these using the CLE 
and then calculate ^(r) estimates from these. 

Using the N-body simulations instead of analytic ap- 
proximations to the halo profiles and clustering allows us 
to make much more detailed comparisons to the shape of 
the correlation functions, which is particularly appealing 
given the very high fidelity of these observations. 

One immediate drawback of calculating the correlation 
function from the N-body simulations (instead of purely 
analytically), is the additional computational time re- 
quired. Given that the MCMC has yielded a chain of 
400,000 elements, the calculation of f (r) for each of these 
is infeasible. Eortunately it is also unnecessary, as the dis- 
tribution of ^(r) can be obtained from a smaller set of 
samples. We choose 200 models, (given that only one in 
every few thousand elements is completely independent), 
at random, from the chain and compute ^(r) for each of 
these, for the 'all galaxy' and 'late- type' samples at z = 0.1 
and z = 0.8 (the effective redshifts of the 2dEGRS and 
DEEP2 ^(r) calculations). We then adopt an 'importance 
re-sampling' technique (Gilks, Richarson & Spiegelhalter 
Il(j96|l to include the goodness-of-fit between the models 
and the observed 2dEGRS correlation function. We only 
fit ^(r) over the range 0.2 < r < 3 h'^Mpc, since for 
separations r < 200 /i~^kpc fibre collisions lead to a de- 
crease in the observed 2dEGRS ^(r) (see e.g. Hawkins et 
al. 20Q, and for r > 3 /i~^Mpc our N-body simulations 
become susceptible to finite box size effects. 

4.3. Results 

Once the importance re-sampling is incorporated then 
all the necessary z observational constraints have been 
included in our CLE determination and the parameter 
space can be reviewed. In particular, we compare in the 
left panel of Eig. |21the mean correlation functions derived 
from our fitted CLEs and N-body simulations to their cor- 
responding observed ^(r) from the 2dEGRS. It can be seen 
from this figure that the models have very successfully fit- 
ted both the correlation function of the 'late-type' and 'all 
galaxy' 2dEGRS samples. 

The mean HODs derived from our model CLEs are 
shown in Eig. |51 In this particular plot we show the mean 



number of i > O.IL, galaxies occupying a halo of the 
specified mass. This particular luminosity cut is the one 
we have adopted throughout this analysis, as it is found 
to correspond well to the observed distributions of lumi- 
nosities in the two surveys. 

It can be seen from Eig. |3] that the two CLEs predict 
very similar low- mass cut-offs in the HOD, but have very 
different behavior at high masses. Clearly star forming 
galaxies appear to dominate the galaxies within low mass 
halos, but become successively more under-represented in 
the highest mass halos. This result is certainly not sur- 
prising given observed morphology-density relations (e.g. 
Dressier Il980f) , however it is interesting to see the effect 
quantified in this way. 

A different way of plotting the CLE is to instead relate 
the probability that a galaxy within a specified range of 
luminosities inhabits a dark matter halo of a given mass. 
This is also shown in Eig. |2| for both the 'all galaxy' and 
'late-type' samples. A similar plot appeared in Yang et 
al. 1)2003(1 for a single representative model, our procedure 
allows us to also include the uncertainty in the model pa- 
rameters in a statistically correct manner. 

5. CONSTRAINING GALAXY EVOLUTION 

Our preliminary analysis is complete, in that we have 
successfully constrained the CLE formulation of the halo- 
model to the observations of the 2dEGRS. In so doing we 
have had to make only the most basic prior assumptions 
suggesting that the observational data is more than ade- 
quate to tightly constrain this model. 

The next step in our analysis is to determine the degree 
of inference we can make as to the evolution in the galaxy 
population to redshift z ^ 0.8, by contrasting the predic- 
tions of this constrained model to the preliminary results 
from the DEEP2 Redshift Survey. 

5.1. Reconstructing the DEEP2 ^(r) 

As discussed previously, the DEEP2 correlation func- 
tion, ^(r), has been determined in Coil et al. H2003|l . and 
it is this estimate that we now try to recover from our 
CLE models. In particular, we use the ^(r) estimated over 
the redshift interval 0.7 < z < 0.9 which has an effective 
redshift Zcff = 0.8. 

The simplest assumption we can make in propagating 
our model to higher redshifts, is that the CLE has not 
changed between z = 0.1 and z — 0.8. In order to test 
this assumption we again turn to our N-body simulations, 
which allow us to extrapolate the dark matter halo dis- 
tribution to z = 0.8. these simulations can then be re- 
populated and the correlation functions recalculated. 

The results of this analysis are presented in the right 
panel of Eig. [3 where the mean N-body ^(r) is shown for 
both the 'late-type' and 'all galaxy' CLEs (after impor- 
tance re-sampling with respect to the 2dEGRS correlation 
functions at z = 0.1), propagated to redshift z = 0.8. 
The fractional uncertainty is 20%, independent of scale. 
What is remarkable from this figure is the fact that the 
predicted correlation function at z = 0.8 from our fitted 
CLEs is completely consistent with that which has been 
observed in the DEEP2 Survey - especially for the late- 
type 2dEGRS galaxy sample which probably corresponds 
more closely to the galaxies observed in the DEEP 2 Red- 
shift Survey (see i)3.1.1|l . The agreement could be even 
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Fig. 2. — The left panel shows the mean correlation functions after performing importance re-sampling on the model CLFs fit to the 
2dFGRS for 'late-type' (dashed line) and 'all' (solid line) galaxies. It can be seen that our models can recover the observed ^(r) in great 
detail. In each plot, the dotted vertical line at r = 3 h^^ Mpc shows the maximum separation over which our l;{r) estimates from the N-body 
simulations arc robust. The right panel shows the correlation function calculated using the DEEP2 Rcdshift Survey (points), and compares 
this with the mean correlation functions, derived from the same model CLFs used in the left panel (after extrapolating the N-body simulations 
to z = 0.8). The agreement between the mean model CLF {(r)'s at z = 0.8 and the observed DEEP2 ^(r) is very good and quite unexpected. 
This result suggests that we cannot rule out the null-hypothesis of no evolution in the halo model between redshifts 2 = 0.1 and z = 0.8. 




Halo mass, M (Mo/h) M (M,/h) 



Fig. 3. — The mean HOD, N{M), is shown for galaxies with L > O.IL* (thick lines; left panel), as derived from the CLFs after importance 
re-sampling to fit the observed 2dFGRS correlation function. The variance at each mass is indicated by the thinner lines bracketing the 
central one. It can be seen that the late-type galaxies appear to dominate low mass halos, but become steadily more under-represented in 
higher mass halos. The right panel shows how we can invert this to derive the probability that a galaxy with a given range of luminosity 
inhabits a halo of mass, M. This function, P{M\L), is shown here for both O.IL* and L* galaxies, the solid lines again correspond to the 'all 
galaxies' sample and the dashed lines to the 'late-type' sample. 
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better than shown in Fig. [5] if we account for the effects 
of mask making on the DEEP2 data points at small scales 
(< 300 /i~^kpc) and finite field effects at larger scales. 

This result is extremely surprising, in that it appears to 
imply that we cannot rule out an unevolving halo model 
to z f: the galaxies observed in the DEEP2 Redshift 
Survey appear to inhabit their parent dark matter halos 
in the same way as those observed in the 2dFGRS. Had 
the cut-off in N{M, > O.f L*) evolved to higher masses we 
would have seen a steepening in ^(r) on sub-Mpc scales. 
Also, had the iV < 1 tail of 7V(M, > O.f L*) extended over 
a broader mass range at z ~ 1 than z we would have 
seen a suppression of ^(r) on sub-Mpc scales. Similarly, 
steepening the high-Af slope of N{M) increases the bias, 
which is not observed. 

It is difficult to place precise limits on what functional 
forms are allowed because of the limited amount of data, 
the uncertainty in the underlying cosmological model and 
the large dimensional space in which we are working (with 
its associated degeneracies). As a rough gauge of sensi- 
tivity, increases in the mass cut-off by even an order of 
magnitude are strongly disfavored as is steepening N(M) 
to TV oc M. We defer a more thorough analysis until more 
data become available. 

5.2. Other predictions for the DEEP2 Survey 

Given that the CLF does not appear to evolve signifi- 
cantly at high redshifts, it is now possible to make a se- 
ries of predictions relating to what we expect to see in the 
DEEP2 Survey as more data becomes available. We briefly 
describe a handful of these predictions in this section. 

5.2.1. The luminosity function 

The evolution of the LF is perhaps one of the most nat- 
ural outputs of the conditional luminosity function. Given 
that we can accurately trace the dark matter halo mass 
function through N-body simulations, Eqn. ^ provides a 
simple link between the two. 

We have calculated the expected LF at z = 0.8, given 
the constraints placed on the CLF from the 2dFGRS ob- 
servations. Interestingly it shows little evolution - only 
a small variation in the faint-end slope, a, and the char- 
acteristic luminosity, L*, are seen io z = 0.8. This is an 
interesting result which suggests that our hypothesis of a 
non-evolving CLF also implies little evolution in the rela- 
tive distribution of luminosities of galaxies. Note that our 
clustering analysis has been determined entirely in terms 
of relative luminosities (L/L*) so evolution in the absolute 
value of is not constrained. In fact, given that we ex- 
pect the distribution of galaxies to be relatively established 
by z = 0.8 (as would be expected for no evolution in the 
CLF), a purely passive change in would be entirely con- 
sistent with our results. Such a change would be brought 
about by e.g. the internal dimming of the galaxies as their 
stellar populations became successively older and fainter. 
However, our prediction of the distribution of luminosities 
relative to should be robust, and so for this reason we 
predict that little evolution in the faint end slope of the LF 
will be seen in the DEEP2 Survey, when compared to the 
2dFGRS. Once a detailed calculation of the DEEP2 LF 
becomes available we will pursue this possibility in more 
detail. 



5.2.2. Luminosity dependent bias 

The average bias at z ~ has a luminosity dependence 
close to that reported by Norberg et al. (j20QI| , since these 
data were used in the fit. Overall we find that the bias 
is determined by the data to 2% for L <C and 5% for 
L ^ L^,, both much stronger constraints than our prior 
on b. We can then ask how the mean bias relation evolves 
with redshift, again assuming that the CLF is independent 
of z. By z = 0.8 the mean bias has risen 30%, and the 
variance across the chain is between 5-9% depending on 
L. The luminosity dependence has steepened from 0.15 to 
~ 0.25 as shown in Table 

6. CONCLUSIONS 

In this paper we have attempted to constrain the de- 
gree of evolution in the way galaxies inhabit their parent 
dark matter halos by contrasting the very accurate obser- 
vations of the galaxy population in the 2dFGRS (at z ^ 0) 
to preliminary results from the DEEP2 Redshift Survey at 
z — 0.8. In order to do so we have had to carefully consider 
the fidelity of the different observational results available 
for this analysis and how these relate to the different se- 
lection effects in each survey. 

Based upon a comparison between the types of galaxies 
observed in each survey, we were able to conclude that the 
DEEP2 Redshift Survey contains a much larger fraction of 
'late-type' galaxies i.e. those currently under-going signif- 
icant amounts of star formation. For this reason we have 
attempted to not only contrast this survey with the full 
2dFGRS, but also with observational results based upon 
only the 'late-type', star forming galaxies in this survey. 
In this way we believe we are able to make a much more 
fair comparison between the two surveys, in that we can 
effectively 'bracket' the population observed in the DEEP2 
Redshift Survey. Note that as more data becomes avail- 
able in the DEEP2 Survey, this bracketing will no longer 
be necessary as we will be able to directly compare the 
different types of galaxies in each survey. 

A great deal more observational results are available at 
z = than at z ~ 1, and for this reason we have decided 
to adopt the approach of making the most detailed fit pos- 
sible of the HOD at z = 0, and to then test how consistent 
this HOD was to preliminary clustering results at z — 0.8. 
In so doing we were able to show that the current obser- 
vation of the correlation function at z — 0.8 is remarkably 
consistent with no evolution in the HOD to this redshift. 

This result is very surprising. The HOD quantifies 
the impact of galaxy formation and interaction processes 



L/L, 


b{z = 0.1) 


6(z = 0.8) 


0.3 


0.94 ±0.03 


1.13 ±0.05 


0.7 


1.02 ±0.03 


1.26 ±0.05 


1.0 


1.08 ±0.04 


1.34 ±0.06 


1.5 


1.18 ±0.05 


1.48 ±0.09 



Table 1 

The large-scale, linear theory bias at z = and 
z = 0.8 from our importance resampled chains for all 
GALAXIES. The mean and variance defined by the 200 

ELEMENTS IN THE CHAIN ARE REPORTED. 
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present in the galaxy population, and in particular how 
this relates to the underlying dark-matter halo distribu- 
tion. Our results suggest that there has in fact been no 
change in the way galaxies are distributed in their host 
dark-matter halos over approximately half the age of the 
Universe. 

At present our results are preliminary, in that the cor- 
relation function of galaxies at z = 0.8 is still relatively 
poorly constrained. In addition, the absence of an es- 
timated luminosity function for this population makes a 
more detailed comparison of the evolutionary processes 
difficult. However, as the DEEP2 Redshift Survey con- 
tinues to increase its sample size we expect these issues to 
be resolved so that much more conclusive remarks can be 
made. In addition, as the sample size increases, correla- 
tion functions for different types of galaxies in the DEEP2 
Redshift Survey will become available, which will allow us 
to be much more precise about the importance of galaxy 
type selection in our results. 

Other improvements to our analysis will be forthcoming 
when results from the recently begun VLT-VIRMOS Red- 
shift Survey (Le Fevre et al. |I1999) become available. This 
survey comprises an /ab -selected sampling of the galaxy 
population at similar redshifts to the DEEP2 Redshift Sur- 
vey, which at these redshifts is much more comparable to a 
rest-frame _B-selection of galaxies. For this reason the in- 
corporation of this survey into our analysis should allow us 
to more accurately address the issues of sample selection 
and to what degree these impact our conclusions. 

We would like to thank Marc Davis, Brian Gerke and 
Joanne Cohn for helpful discussions, and Alison Coil for 
providing the DEEP2 correlation function in electronic for- 
mat. M.W. would like to thank Ravi Sheth and Andreas 
Berlind, and D.S.M. thanks Shaun Cole for numerous en- 
lightening discussions on the halo model. The simulations 
used here were performed on the IBM-SP at the National 
Energy Research Scientific Computing Center. This re- 
search was supported by the NSF and NASA. 
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APPENDIX 

THE HALO MODEL 

In this appendix we describe the details of the halo 
model formalism that we use in the main paper. A review 
of the halo model, with reference to the original literature, 
can be found in Cooray & Sheth H2UU2|I . 

The halo mass function, dn/dM, describes the (comov- 
ing) number density of halos with mass in the interval 
[M, M + dM] . This function depends on the redshift and 
the power spectrum of dark matter halos as well as the 
underlying cosmology. Typically one adopts a change of 
variables, to give a dimensionless form of this function in 
terms of the halo peak height ly. Specifically, the multi- 
plicity function, f{v), is used since this is independent of 
cosmology. The relation between this multiplicity function 
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and the halo mass function is simply given by, 

4-J{v)dy = 4^dM . (Al) 

where p is the mean matter density of the Universe, and 
the cosmology dependence resides entirely in the peak 
height definition, 

V = [<5e(z)/a(M)]2 . (A2) 

Note that many authors use ly — Sc(z)/<j{M) rather than 
its square as we have done here. Here Sc — 1.868 is a 
threshold parameter taken from the theory of spherical 
top- hat collapse while a-(M) is the rms mass fluctuation 
within spheres of radius R — [M/(47r/3/3)]^/'^ evaluated 
in linear theory, at redshift z. For the multiplicity func- 
tion, /(i^), we use the 'ST' form, motivated by ellipsoidal 
collapse and fit to N-body simulations (Sheth & Tormen 
[Mil : 

lyfii^) = A{1 + ly'-py^/^e-"'/^ , (A3) 

where p — 0.3 and i^' — 0.707j^. The constant A deter- 
mines the normalization and is fixed by the requirement 
that all the mass lie in a given halo. 

We can regain the older Press & Schechter p974fl expres- 
sion by taking v' — v and p — Q. 

Halos within some mass range [M, M -\- dM] are biased 
tracers of the underlying matter distribution. To linear 
order the bias can be computed from the peak-background 
split (Efstathiou et al. [^551 Cole & Kaiser Mo & 

White . 1996,1 which for the ST mass function gives. 

This bias is appropriate for very large scales where the 
bias is expected to be deterministic, linear and scale in- 
dependent. Note that this scheme automatically satisfies 
the requirement that the mean bias of mass is unity: 

(&)mass = / K^)f{^)dv = 1 • (A6) 

Given a HOD function {N) (M) we can use the mass 
function and halo biasing scheme described above to easily 
compute the mean galaxy bias (which is defined as the 
scale independent linear bias of galaxy relative to mass), 

^^^s = f / ^/(^)^(^)^'^ • 

Note that n is the mean density of galaxies, and can be 
computed from, 




GALAXY CATALOGUES 

To compute the correlation function given the CLF 
we make use of a large N-body simulation of a ACDM 
model. The simulation employs 512'^ particles in a pe- 
riodic, cubic box of side 128/i~^Mpc and was run with 
the TreePM code described in White H2UU2|I . The cos- 
mological model was chosen to provide a reasonable fit to 
a wide range of observations with = 0.3, J^a = 0.7, 
Ho = lOO/ikms-^Mpc"^ with h = 0.7, fle/i^ = 0.02, 
n = 0.95 and cts = 0.9 (close to the best fit to the CMB 
data for this cosmology). The gravitational force softening 
was of a spline form, with a "Plummer-equivalent" soft- 
ening length of 9/i~^kpc comoving. The particle mass is 
1.3 X 10^ h~^MQ. The simulation was started at 2: = 100 
and evolved to the present with the full phase space distri- 
bution dumped every 128/i~^Mpc from 2; ~ 3 to z = 0. As 
a check on finite volume effects we compared our results 
to those from a similar simulation in a box with force soft- 
ening and box side twice as large, with 8x more massive 
particles. It is based on this comparison that we restrict 
our fits to r < 3/i~^Mpc. 

We use outputs at redshifts appropriate to the 2dF {z ~ 
0.1) and DEEP2 (z ~ 0.8) surveys. For each output we 
produce a halo catalogue by running a "friends-of-friends" 
group finder (e.g. Davis et al. fT985) with a linking length 
5 = 0.15 (in units of the mean inter-particle spacing). This 
procedure partitions the particles into equivalence classes, 
by linking together all particle pairs separated by less than 
a distance b. We keep all halos with more than 8 particles, 
and consider each of these halos as a candidate for hosting 
'galaxies'. 

We populate the simulation with 'galaxies' by marking 
certain simulation particles and assigning them luminosi- 
ties. The HOD function computed from the CLF model 
gives the mean number of galaxies (more luminous than 
some Lcut) which would be in a halo of mass M. The halo 
mass is estimated as the sum of the masses of the parti- 
cles in the FoF halo, times a small correction factor which 
provides the best fit to the Sheth- Tormen mass function 
(Sheth & Tormen Il999|l . An actual number of galaxies is 
drawn from a distribution for each halo in the simulation, 
and we use the nearby integer distribution. 

Once the number of galaxies in each halo is known, they 
are assigned luminosities from $(Z/|M). The most lumi- 
nous galaxy is assigned to the center of mass of the halo, 
and the other galaxies are assigned to random particles 
within the halo. While the code allows the possibility of 
a radial or velocity bias in assigning galaxies to particles, 
throughout we assumed that galaxies traced the mass and 
velocity distribution of the halo (inheriting its shape and 
any substructure). 

Given the total number of galaxies brighter than Lcut 
in each halo it is necessary to choose luminosities for them 
based on ^{L\M). Just as it was necessary to specify 
both (N) and the higher moments above, it is necessary 
to know the fluctuations about the mean ^{L\M) at this 
stage. If the luminosities of galaxies each halo are drawn 
independently from $(L|M) then one occasionally finds 
relatively low mass halos with two 'bright' galaxies. Since 
such systems have small radii this in turns implies an in- 
crease in the 'bright' galaxy correlation function at small 
scales. Such an increase can indeed be seen in some of the 



semi-analytic models, but appears to be absent in data. 
This suggests that some mechanism acts to suppress pairs 
of bright galaxies in small halos. We can model this in a 
number of ways. On one extreme we could calculate the 
luminosities for all galaxies in halos of similar masses by 
drawing from <i>(L|M) and then distribute them, round- 
robin, in halos in order of decreasing luminosity. This 
ensures that all the bright galaxies are partitioned among 
the halos rather than having pairs end up in any one halo. 
A slightly different approach, which has very similar clus- 
tering properties, was suggested by Yang et al. I|2UU3|I . 
Here we compute Li such that a halo of mass M has (on 
average) only 1 galaxy brighter than Li. We then draw 
luminosities for the galaxies in this halo, allowing only the 
brightest galaxy to have L > Li. This also suppresses the 
higher moments of $(L|M) for bright galaxies. We shall 
follow Yang et al. (2003,) unless stated otherwise. 



